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Abstract. We consider (graph-)group-valued random element ^, discuss the 
properties of a mean-set E(5), and prove the generahzation of the strong law 
of large numbers for graphs and groups. Furthermore, we prove an analogue of 
the classical Chebyshev's inequality for t; and Chernoff-like asymptotic bounds. 
In addition, we prove several results about configurations of mean-sets in 
graphs and discuss computational problems together with methods of com- 
puting mean-sets in practice and propose an algorithm for such computation. 
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1. Introduction 

Random objects with values in groups and graphs are constantly dealt with in 
many areas of mathematics and theoretical computer science. In particular, such 
objects are very important in group-based cryptography (see [23^ or [9^ for introduc- 
tion to the subject). Having the notion of the average for random group elements, 
generalized laws of large numbers for groups with respect to this average together 
with results on the rate of convergence in these laws would broaden the range of 
applications of random group objects from both theoretical and practical point of 
view. With a continuing development of group-based cryptography, availability of 
such tools for analysis of probability measures and their characteristics on groups 
becomes especially important. In this paper, we develop these probabilistic tools 
for finitely generated groups and propose practical algorithms for computing mean 
values (or expectations) of group/graph- valued random elements. The results of 
this paper form a new mathematical framework for group-based cryptography and 
find applications to security analysis of Sibert type authentication protocols (|24j). 

The classical strong law of large numbers (SLLN) states that for independent 
and identically distributed (i.i.d.) real- valued random variables 

n 

(1) -E^^^iE(a) 

i=l 

almost surely (with probability one) as n oo, provided that expectation E(a) is 
finite (see [5], [H], or [30]). It is natural to pose a question about the existence of 
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counterparts of this result for different topological spaces and/or algebraic struc- 
tures, including groups. Starting from the middle of the last century, there has 
been ongoing research, following different trends, concerning the existence of such 
generalizations of the SLLN. One line of this research investigates random walks on 
groups (see Section fl.l.Sl for a brief list of relevant literature sources). The present 
work follows another direction of that research ~ the one which is concerned with 
the problem of averaging in arbitrary metric spaces. We generalize classical prob- 
ability results to groups starting with the concept of expectation (mean value) for 
group elements. Then we prove the almost sure (with probability one) convergence, 
in some appropriate sense, of sample (empirical) means for group/graph random 
elements to the actual (theoretical) mean, thus, generalizing the classical law ([T]) 
and preserving its fundamental idea. We supplement our results with the analogues 
of Chebyshev and Chernoff-like bounds on the rate of convergence in the SLLN for 
random graph/group elements. 

1.1. Historical Background. Below we give a brief account of some developments 
concerning probabilities and mean- values for various spaces as well as some already 
existing generalizations of the strong law of large numbers in order to highlight 
several stages of research that preceded our work. The reader willing to proceed to 
the core of our work right away may skip this section and move on to Section 11.21 

1.1.1. Linear spaces. In 1935, Kolmogorov |22| proposed to study probabilities in 
Banach spaces. Later, the interpretation of stochastic processes as random elements 
in certain function spaces inspired the study of laws of large numbers for random 
variables taking values in linear topological spaces. Banach spaces fit naturally into 
the context of the strong law of large numbers because the average of n elements 
xi, . . . ,Xn in a Banach space is defined as n~^{xi -I- ... -I- a;„). In addition, Banach 
space provides convergence, and Gelfand-Pettis integration provides the notion of 
expectation of a random element (see [H] and [27]). It goes as follows. Let X be 
a linear space with norm || • || : X K and X* is the topological dual oi X. A 
random X-element £, is said to have the expected value E(^) e X if 

E(/(0) = /(E(0) 

for every / S X*. Let be a sequence of random X-elements. Without loss 

of generality, we may assume that E^^ — for every i. The strong law of large 
numbers in a separable Banach space X for a sequence of i.i.d. random X-elements 
is first proved in |25|. It states that 

lim ||n-i(6 + ...+^„)|| =E(a) = 

n— f oo 

with probability one. The strong law of large numbers for i.i.d. random elements 
in a Frechet space was proved in [1]. A few other works discussing generalizations 
of the strong law of large numbers in linear spaces are [2] , [3] , [33] . 

1.1.2. Metric spaces. Unlike in linear spaces, in a general (non-linear) topological 
space X, one has to do find some other ways to introduce the concept of averaging 
and expectation. In 1948, Frechet, [12], proposed to study probability theory in 
general metric spaces and introduced a notion of a mean (sometimes called Frechet 
mean) of a probability measure /x on a complete metric space {X, d) as the minimizer 
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of E(f{x, y), where 

E(fix,y)^ I d^{x,y)fi{dy) 



IX 

when it exists and is unique. If the minimizer is not unique, then the set of mini- 
mizers can be considered. If ^ : £7 — > X on a given probabihty space P) (see 

[1], [H]) is a random element in {X, d) and if for some x S X, 

(2) i?d2(^,x)= inf £;d2(C,2/)<oo, 

y£X 

then X is called an expected element of X. These generalizations were not met with 
much enthusiasm at the time (see historical remarks on probabilities in infinite 
dimensional vector spaces in H^), and Frechet's suggestions, due to the luck of 
their applications, underwent rather slow developments in the middle of the last 
century. 

Let us briefly mention some existing works on generalizing the classical SLLN to 
a metric space X. Let be a sequence of i.i.d. random elements with values 

in X. Let the expectation be defined as in ([2]), written as a set 

E(X) ^{xeX Ed^{i,x) = inf Ed^{C,y) 

Define an empirical mean (average) of elements £,i{uj), . . . ,£,n{<^) to be the set 



M{^,,...,^^)^ {xeX 



i—l i—1 > 



One of the first works on generalization of the SLLN for metric spaces is given in 
1977 by Ziezold ([36]). Ziezold considers a separable quasi- metric space X with a 
finite quasi-metric d. For a sequence of i.i.d. random X-elements such that 

Ed^{£,,x) is finite for at least one x ^ X,he proves that inclusion 



(3) M(c.)=f| [jM(6(c.),...,e„(c.))CE(Ci) 



k—l n—k 



holds with probability one. Here, IJ^fc • ■ • > Cn('^)) is the closure of the 

union of M(^i(cj), . . . ,f„(cj))'s. He also shows that, in general, the equality does 
not hold (for a finite quasi- metric space). In 1981, Sverdrup-Thygeson ([32]) proves 
inclusion ^ for compact connected metric spaces and shows that the equality does 
not hold in general (for a metric space) when the minimizer in ([2]) is not unique. 
In 2003, Bhattacharya and Patrangenaru in [H Theorem 2.3] prove equality in (|3]) 
for the unique minimizer in ([2]) for metric spaces X such that every closed bounded 
subset of X is compact, improving Ziezold's and Sverdrup-Thygeson's results. 

Manifolds. As the need for statistical analysis for spaces with differential geomet- 
ric structure was arising, statistical inference on Riemannian manifolds started to 
develop rapidly, especially due to applications in statistical theory of shapes and 
image analysis. These applications evolve around the concept of averaging. See 
[3T] for an introduction into shape theory. The interested reader may also refer to 
|18) . for instance. 

There are two main approaches to averaging of elements on a manifold. Every 
Riemannian manifold X is a metric space and hence one can use constructions from 
the previous section to define the notion of a mean. Frechet mean of a probability 
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measure on a manifold is also known as an intrinsic mean 4 . Non- uniqueness of 
the intrinsic mean is a source of different technical problems. Also, the intrinsic 
mean, even when unique, is often very difficult to compute in practice. 

On the other hand, a manifold X can also be looked at as a submanifold of 
some Euclidean space M'^ and one can define a mean relative to this inclusion. Let 
T : X — >■ K*^ be an embedding of X into Euclidean space (R*"', do). A point p e R*^ is 
called nonfocal if there exists a unique x e t{X) such that do{p,x) = do{p,T{X)). 
Let /i be a probability measure on X, fi' a. probability measure on R*^ induced by 
T, and x* e M'' the expectation of /i'. We say that the measure ^ is nonfocal if 
X* is a nonfocal point. For a nonfocal probability measure fi on X we define the 
mean as t^^{x*). In [4] the authors prove the strong law of large numbers for the 
intrinsic and extrinsic means on manifolds. 

1.1.3. K-means. A notion of a mean (or a mean-set) can be generalized into fc-mean. 
Let -B be a Banach space with a norm || • || and fc G N. For a set i7 = {hi, . . . , h^} 
we define a partition of B as follows 



S, = {xeB\ \\x- h,\\ < \\x - hjW for every j = 1, . . . , fc} \ (S*! U . . . U Si-i) 
where i ~ 1, . . . , fc and a function tth '■ B B 



where Is- is the indicator function of Si. Fix a suitable non-decreasing function 
$ : R+ R+ (e.g., ^{x) — x or $(x) — x^) and for a probability measure /i on B 
define a number 



A set Hq of k elements that minimizes the value of M is called a k-mean of a 
probability distribution fi. In general, there can be several minimizers, which leads 
to technical complications. In 1988, Cuesta and Matran (j8:) proved that empirical 
fc-means converge to the fc-mean Hq of fi under the assumption that the fc-mean is 
unique. 

It is straightforward to generalize a notion of a fc-mean to a general metric space 
{X,d). Indeed, if we put 



S,^{x(EB\ d{x, hi) < d{x, hj) for every j = 1, . . . , fc} \ (Si U . . . U S^-i) 



then we get a similar notion. This type of fc-means, with $(a;) = x, was considered 
by Rubinshtein in 1995 in [53] where it was called the fc-center. 

As we can see, in general, depending on the research goals, one can define mean 
values on a given metric space {X, d) using any powers of d, i.e., instead of dealing 
with minimization of Ed'^{^,x) in ([2]), one can work with a very similar functional 
by minimizing Ed^{^, x) for any r > if necessary. 



k 





and 
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1.1.4. Probabilities on algebraic structures. Metrics and probabilities on algebraic 
structures have been studied Irom different perspectives. One source to look at is 
the book of M. Gromov [17; . The reader can find some applications of Frechet mean 
in statistical analysis of partially ranked data (such as elements of symmetric groups 
and homogeneous spaces) in the book of Diaconis ([10]). An extensive historical 
background of the studies of probabilities on algebraic structures is given in [16) . 
where the author considers probabilities for stochastic semi-groups, compact and 
commutative stochastic groups, stochastic Lie groups, and locally compact stochas- 
tic groups employing the techniques of Fourier analysis to obtain limit theorems 
for convolutions of probability distributions. The reader interested in the question 
of defining probabilities on groups can find several approaches to this issue in [6]. 

1.1.5. Random walks on groups. One way to generalize the strong law of large 
numbers for groups is to study the asymptotic behavior of the products gig2 ■ ■ ■ gn, 
where {gi}fZi is a- sequence of i.i.d. random group elements, the so-called random 
walk on a group. The reader can consult |35| for an introduction to random walks 
on groups. In 1960, Furstenberg and Kesten ([14^) prove the generalization of the 
strong law of large numbers for random matrices. Namely, they show that the limit 

l0g||ffl52---gn|| 

hm 

exists with probability one, with some restrictive conditions on the entries of gi, 
without computing the limit explicitly. In 1963, Furstenberg solved this problem 
for normalized products of random matrices in terms of stationary measures ([13j). 
Computational techniques that would allow to compute these measures are investi- 
gated in |28j . A concise account of a number of illuminating results in the direction 
of the generalization of the SLLN to groups can be found in [20] , where the authors 
prove the theorem about the directional distribution of the product gig2 ■ . ■ gn, 
thus, proving a general law of large numbers for random walks on general groups. 
The authors call it a multiplicative ergodic theorem or a general, noncommutative 
law of large numbers (see [20] for the precise statement). 

1.2. The core of our work. Motivated by applications to group-based crypt- 
analysis, we study Frechet type mean values and their properties in graph/group 
theoretic settings. 

Let r = {V{T), E{r)) be a locally finite graph and {il,J^,P) a given probability 
space. A random F-element ^ is a measurable function ^ : ^(r). This random 
F-element ^ induces an atomic probability measure fi : V{T) — > [0, 1] on V{T) in a 
usual way: 

^i{v) = P({a; e n I ^{uj) = w}), V £ V{r). 
Next, we introduce a weight function : V^(r) — > R by 

M^{v)^Ed\v,0= E d\v,s)fi{s), 
sev{r) 

where d{v, s) is the distance between v and s in F, and note that, trivially, the 
domain of definition of M^{-) is either the whole V(r) (in which case we say that 
M is totally defined) or 0. The domain of M is the set 



domain{M) = < w £ ^(L) 



E d^{v,s)^.^{s) < CO 
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In the case when domain{M^) ~ V{T), we define the mean-set of to be 

(4) E(e) = {ve ViT) I M^{v) < M^{u), Vit G V{T)}. 

The above definition of E(<^), ^ : 51 — ^ ^(r), provides the corresponding notion of a 
mean (average, expectation) for finitely generated groups via their Cayley graphs. 

Once we have the notion of mean-set for group-valued random elements, we 
notice that it satisfies the so-called "shift" property; namely, 

(5) E(50-5E(0,V<?eG 

which is analogous to the linearity property of a classical expectation for real- valued 
random variables. 

Next, for a sample (w ),..., ^„(a;) of i.i.d. random F-elements we define a 
relative frequency fj,n{u;u!) — ^n{u) with which the value u G V'(r) occurs in the 
sample above: 

/^„(u) u)\. 
n 

Relative frequency /i„ is a probability measure on F, and we can define empirical 
(sampling) weight function (random weight) as 

Mniv)^ J2 d^{v,s) finis). 

sev{r) 

Going further, we define an empirical mean or sample mean-set of the sample 
^1 , . . . , to be the set of vertices 

• • • ,60 = {ve ViT) I Mniv) < M„(u), Vu e ^(r)}. 

The function S(^i, . . . , f„) on graphs is an analogue of the average function (xi, . . . , a;„) 
(xi -I- . . .-\-Xn)/n for xi, . . . , x„ G R. We let S„ = §(■?!, • . • , £,n)- With these notions 
at hand, we first formulate and prove the following generalization of the strong law 
of large numbers for graphs and groups with one-point mean sets. 

Theorem A. (Strong Law of Large Numbers for graphs ) Let T be a locally- 
finite connected graph and o. sequence of i.i.d. random T-elements. If the 
weight function Af^j(-) is totally defined and E(^i) = {v} for some v G V{T), then 

lim S(ei,...,C»)=E(ei) 

n—^oc 

with probability one. 

Next, we improve this result and prove the generalized law of large numbers 
for groups for the case when |E^| > 1 (see Section [OT) . The simplest version of 
multi-vertex SLLN in terms of limsup is as follows: 

Theorem B. (Multi- vertex SLLN for graphs ) Let T be a locally-finite con- 
nected graph and {6}^i a sequence of i.i.d. random T-elements. Assume that the 
weight function M is totally defined and E(^) — {vi, . . . , Vk}, where fc > 4. // the 

random walk associated to vi is genuinely (fc — 1)- dimensional, then 

lim sup §„ = E(^i) 

holds with probability one. 

In addition, we prove analogues of classical Chebyshev's inequality and Chernoff- 
like bounds for a graph- (group-) random element ^. 
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Theorem C. (Chebyshev's inequality for graphs) Let T be a locally-finite 
connected graph and {^i}fZi a sequence of i.i.d. random T-elements. If the weight 
function Afjj(-) is totally defined then there exists a constant C — C(r,^i) > 
such that 



C 



Theorem D. (ChernofT-like bounds for graphs) Let T be a locally-finite con- 
nected graph and {S,i}fZi sequence of i.i.d. random V -elements. If the weight 
function M^^ totally defined and /i^^ has finite support, then for some constant 
C > 

P(siii,...,U^m)) <0(e^^"). 



1.3. Outhne. In Section [21 we give basic definitions and discuss some properties 
of the newly defined objects. Next, we turn to tfie formulation and the proof of 
the strong law of large numbers on graphs and groups. These tasks are carried 
out in Section [3] Chebyshev's inequality and ChernofF-like bounds for graphs are 
proved in Section U) In Section [51 we consider configurations of mean-sets in graphs 
and their applications to trees and free groups. Section [HI deals with computational 
problems and methods of computing E(^). In particular, we propose an algorithm 
and prove that this algorithm finds a central point for trees. Finally, in Section 
[7] we perform series of experiments in which we compute the sample mean-sets of 
randomly generated samples of n random elements and observe the convergence of 
the sample mean-set to the actual mean. 

2. Mean (expectation) of a group-valued random element 

Let (fl, T , P) be a given probability space and G a finitely generated group. In 
this section, we define the notion of expectation for graph random elements in the 
sense of Frechet mean set, which is one of the possible ways to look at mean values 
(see Section ri.l.2|) . The same definition will hold for random elements ^ : G 
on groups. We also discuss properties of the mean sets on groups. In particular, 
we prove that for our expectation E, we have E(g^) = 5E(^). 

2.1. The mean set in a graph. Let F = (V^(F), i?(r)) be a locally finite connected 
graph. A random F-element ^ is a measurable function ^ : f2 Y{V\ The random 
element ^ induces an atomic probability measure /i : V^(r) — > [0, 1] on V{T) in a 
usual way: 

(6) = Mc(«) = P({^ e ^ I e(^) = «}), « e F(F). 

Next, we introduce a weight function : V{T) — ?> R by 

M^{v)=Ed^v,0= J2 d^{^,s)fi{s), 
sev{r) 

where d{v,s) is the distance between v and s in F. If M^{v) is finite, then we say 
that the weight function is defined at v. The domain of M is the set 



domain{M) — Iv ^(F) 



^ d^(u, s)^^(s) < oo 
s6V(r) 
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The case of interest of course, is when M^(t;) is totally defined, meaning that M^(w) 
is finite at every v G V^(r). 

Definition 2.1. Let ^ be a random F-element such that M^(-) is totally defined. 
The set of vertices u e F that minimize the value of 

(7) E(e) = {v£ v{r) I M^{v) < M^(u), Vu e v{r)}, 

is called the mean-set (or the center-set, or average) of ^. 

Very often we leave the random element ^ in the background to shorten the 
notation and write M{v) instead of M^{v). Moreover, we write instead of 

E(^) sometimes and speak of the mean set of distribution fi induced by ^ on V{T). 

Lemma 2.2. Let T be a connected graph, ^ a random T-element, and u, v adjacent 
vertices in F. If M{u) < oo, then M(v) < oo. 

Proof. Easily follows from the definition of M and the triangle inequality. □ 

Corollary 2.3. Let F be a connected graph and ^ a random F-element. Then 
either domain(M) ~ V{T) or domain(M) = 0. 

Lemma 2.4. Let T be a connected locally finite graph and ^ a random T-element. 
If is totally defined, then < |IE(C)| < oo- 

Proof. Let /i be a measure of ^ induced on F by ^. For an arbitrary but fixed 
vertex v gT, the weight function 

oo / 

iev{r) n=o y iev{r).d{v.i)=n 

is defined at v by assumption. Choose r G N such that 



lM{v)<j2\n^ 

n=0 \ 



ieV{r),d{v,i)=n I ieB^(r) 



where 

(8) B,{r) = {ieV{V)\d(v,i)<r} 

is the ball in F of radius r centered at w. If we take a vertex u such that d{u, v) > 3r, 
then using the triangle inequality, we obtain the following lower bound: 

M{u)= E d^iu,i)t^ii) > E E d'^{u,i)li{i) > 4: E , i) K^) > (v) . 

ieV(r) ieB^(r) i^B^(r) ieB^(r) 

Thus, d{v,u) > 3r implies u E(^) and, hence, E(^) C By{3r). Since the graph 
F is locally finite, it follows that the sets By{3r) and E(^) are finite. This implies 
that the function AI attains its minimal value in By{3r) and hence E(^) 7^ 0. □ 

2.2. The mean set in a group. Let G be a group and X C G a, finite generating 
set for G. The choice of X naturally determines a distance dx on G via its Cayley 
graph Gg{X). Hence Definition 12. II gives us a notion of a mean set for a random 
G-element. It follows from the definition of the distance dx that for any a,b,g G G 
the equality 

(9) dx{a,b) ^ dx{ga,gb) 
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holds. This equality implies that E(^) possesses the desirable property ^{gS,) = 
.gE(^), as the following proposition shows. 

Proposition 2.5 ("Shift" Property). Let G be a group and g E G. Suppose that 
(r2, jF, P) is a given probability space and ^ : Vt ^ G a G-valued random element 
on f2. Then for the random element £,g defined by S,g{uj) = gS,{uj) we have ]E(Cg) = 

gm)- 

Proof. Let fi^^ be the measure induced on G by ^g, in the manner of ([6|). It follows 
from the definition of that for any h Cz G 

fi^^ih) = Pi{u; I Cgiu;) = h}) - Pi{u; \ g^^) = h}) = P({c. | ^(c.) = g'^h}) = fi^ig-'h). 
This, together with implies that for any h E G 

M^^ih) = ^Y.'^\g-'Kg~^^)^l^{g-^^) ^J2d'i9''h,i)fidi) = M^{g~'h). 

i£G ieG ieG 

Hence, the equality (h) — M^{g^^h) holds for any random element ^ and g,h E 
G. Therefore, for any hi,h2 € G, A%(/ii) < M^,ih2) ^ M^{g-^hi) < M^{g-'^h2) 
and 

= [heG\ M^^{h) < M^^if), V/ e g} = {/i e G I AMg-'h) < M^{g-'f), V/ e g} = 

^{heG\ K'k{g-^h) < Me(/), V/ e g} = {gh e G \ Kk{h) < M^{f), V/ G g} = <?E(e). 

□ 

The equality dx{a, b) = dx{ag, bg) does not hold for a general group G = (X). 
It holds for abelian groups. 

Proposition 2.6. Let G be an abelian group and g G G. Suppose that {fl,J-,P) is 
a probability space and ^ : —> G a G-valued random element on f2. Then for the 
random element S^g defined by £,g{oj) = £,i^)g we have E(^g) = (E(^))g. 

2.3. Other possible definitions of E. There are other possible definitions of E 
for which the statement of Proposition 12.51 (and other results of Section [3]) holds. 
Let c be a positive integer. By analogy to the function M^(w), define a weight 
function M^'^\v) of class c by 

iev{T) 

and the mean-set E*^'^)(^) of class c to be 

£('='(0 ^{v£ V{T) I M^^^v) < M^^^u), Vu e V^(r)}. 

It is straightforward to check that all the statements of the previous section hold 
for M^''\-) and E('^)(^). In fact, it is not hard to see that when c = 1, we have a 
counterpart of the median of the distribution fj,. Next proposition shows that our 
E agrees with the classical definition of the expectation on Z in the following sense. 

Proposition 2.7. Let^ : U, ^ "L be an integer-valued random variable with classical 
expectation m ~ X^nez '^■'^(^ ~ Assume that M = is defined on Z. Then 

1 < |E(2)^| < 2 and for any v e E(2)(0; we have |m-i;| < i. 

Proof. Straightforward. □ 
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Remark 2.8. Observe that E^^^ does not coincide with the classical mean in M^. 
Recall that the classical mean in is defined coordinate- wise, i.e., the mean of 
. . . , (x„,?/„) is a point in defined by (EX, EF). For example, consider 
the distribution on 1? such that /x(0, 0) — /i(0, 3) = /i(3, 0) = 1/3 and for all other 
points /i = 0. Then the classical mean is the point (1, 1), and the mean-set E^^^^ is 
the point (0,0). 

3. Strong Law of Large Numbers 

Let ^i,...,^„ be a sample of independent and identically distributed (i.i.d.) 
graph-valued random elements : f2 — l^(r) defined on a given probability space 
(r2,7^, P). For every £ $7, let /i„(u; w) be the relative frequency 

(10) fi„[u;uj)^ 

n 

with which the value u G V(T) occurs in the random sample Cii'-^)^ • ■ • i fn(w). We 
shall suppress the argument a; e il to ease notation, and let 

iev{r) 

be the sampling weight, corresponding to f G V(T), and M„(-) the resulting sam- 
pling weight function. 

Definition 3.1. The set of vertices 

§„ = §(^1, ...An) = {ve V{r) I Mn(v) < Mn{u), Vu e V{t)} 

is called the sample mean-set (or sample center-set, or average) of the vertices 

fl7 • • • 7 6i- 

Lemma 3.2. Let T be a locally-finite connected graph, v € V(T), and 
sequence of i.i.d. random T-elements such that Mjj(u) is defined. Then 

(11) p(^Mn{v) M{v) as n oo) = 1. 

Proof. Foreveryu G V{T), M(v) is the expectation of the random variable d^(u, ^i). 
The result follows by the strong law of large numbers applied to {d^{v, ^i)}^i. □ 



It is important to notice that in general the convergence in Lemma 13.21 is not 
uniform in a sense that, for some distribution on a locally finite (infinite) graph 
F and some e > 0, it is possible that 

V{3N s.t. Vn > TV Vi; e ^(F), |M„(u) - M{v)\ < < 1. 

In other words, the convergence for every vertex, as in Lemma 13. 2[ is insufficient 
to prove the strong law of large numbers, stated in introduction. Next lemma is a 
key tool in the proof of our strong law of large numbers. 

Lemma 3.3 (Separation Lemma). Let T he a locally-finite connected graph and 
sequence of i.i.d. random T-elements. If the weight function il/^j(-) is 
totally defined, then 



p(3N s.t. \/n> N, max Af„(w) < inf MJi 
\ i'GE(5i) «ey(r)\E(5i) 



= 1. 
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Proof. Our goal is to prove that for some S > 

(12) P(3N Vn > iV Vf e E(a), Vm G V^(r) \E(^i), M„(u) - M„(w) > (j) = 1. 

We prove the formula above in two stages. In the first stage we show that for some 
fixed vo G E(^i) and for sufficiently large number m > the following holds 
(13) 

P(3N s.t. Vn > iV Vi; G E(a), Vu G ^(r) \ By„{m), Mn{u) - M„{v) > = 1 

in the notation of (|8]). In the second stage we prove that 
(14) 

P(3N s.t. Vn > AT Vf G E(a), Vit G B„„(m) \ E{^i), Af„(u) - M„(w) > (5^ = 1 

Having the formulae above proved we immediately deduce that (jl2l) holds using 
(T-additivity of measure. 

Let Vq G E(^i) and /it be the probability measure on F induced by ^i, as in ([51). 
Since the weight function M(-) is defined at vq, we can choose r G M as in Lemma 
12.41 such that ^AI{vo) < X]ie-B„o(r) ^^("o, «)m(*)- Put m — 3r. In Lemma [Ol we 
proved that, if a vertex u is such that d{u, vq) > 3r, then 

(15) M{u)= (f{u,i)^i{i)>4: (f{u,i)n{i)>2M{vo). 

It implies that E(^i) C By^{3r). 

Since T is locally finite, the set By„{r) of ([8|) is finite. We also know from the 
SLLN for the relative frequencies /z„(u) that /i„(u) n{u) as n -H- oo. These facts 
imply that for any e > 0, the event 

(16) C, = {3N = N{e), Vti > iV, Vm G S,„(r), |;U„(u) - /i(u)| < e} 

has probability one. In particular, this is true for e = e* = j min{/i(u) | u G 
Byg{r), /i(u) ^ 0}, and the event C^. is a subset of 

(17) I^A^ = N{e*), Vn > iV, Vm G V^(r) \ B,„(3r), M„(m) > ^Af («o)|- 

Indeed, on the event C^. , as in p^ . we have ^„(i) > |m(*)j * £ Byg{r). Using this 
fact together with p5|) . we can write 

M„(u)= ^ d'(u,i)M«(«) > 4 ^ -^^^^-^ > 3 d^{u,i)^l{i)>^M{vQ). 

Thus we have 

(18) p(3N s.t. Vn > TV, Vu G ^(r) \ B,„(3r), A/„(m) > ^M(t;o)^ = 1. 

By Lemma 1321 for any v G V{T) and any e > 0, we have 

p(3iV = 7V(e), Vn > A^, |M„(w) - M{v)\ < = 1 

and, since Byg{3r) is a finite set, we have simultaneous convergence for all vertices 
in By„{3r), i.e., 

(19) P(3N = N{e), Vn > A^, Vv G By„{3r), |M„(w) - M{v)\ < e'^ = 1. 
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In particular, remembering that E(^i) C Bvg{3r), for e = M{vo)/4, 
(20) p(^3N = N{e), Vn > iV, Vw £ E(a), Jm(w) < M„(w) < ^M(t;)^ = 1. 
Finally, we notice that on the intersection of the events in ^TE\\ and (PO)) , we have 
M„(m) - Mn{v) > ^M{v) - h'I{v) = h'I{v) = iM(i;o), 

by the virtue of the fact that M{vq) — M{v) (as both vq,v e E(a)), and formula 
((T3l) holds for any 5 such that (5 < iM(uo). 

For the second part of our proof we use statement ([TO]) that holds, in particular, 

for 

e = e' = imin{M(M) - M{vo) \ u e S„o(3r), M(m) - Af(wo) > 0}. 

It means that, with probability 1, there exists = iV(e') such that for any n > N 
andallu e Byg{3r), we have |M„(u)— M(u)| < e'. Moreover, since E (a) C Byg{3r), 
we can assert the same for any v € E(a); namely, \Mn{v) — M{v)\ < e' . Together 
with the fact that M(u) — M[vq) > 0, the obtained inequalities imply that, with 
probability 1, there exists number N ~ ^{e') such that for any n > N and all 
weB„o(3r)\E(a), 

M„{vo) < M{vq) + e' < M{vo) + \{M{u) - M{vo)) 
M{u) - ^(M(u) - M{vq)) < M{u) -e' < Mr,{u), 

and, hence, 

M„(u) - A/„(«o) > M{u) - ^{M{u) - M{vo)) - M{vo) - ^{M{u) - M{vo)) = 

= i(Af(ti)-M(«o))>2e', i.e., 

P(3N - Nie), Vn > N, Vu e B„„ (3r) \ E(ei) : M„(u) - M„(wo) > 2e') 

Therefore, holds for any 6 < 2e'. Choosing 6 = min(-iM(z;o), 2e') finishes the 
proof. 

□ 

Corollary 3.4 (Inclusion Lemma). Let F be a locally-finite connected graph, 
{£,i}fZi a sequence of i.i.d. random F-elements, and fi — /i^^. Suppose that the 
weight function Afj(-) is totally defined. Then 



1. 



P limsup§(a,...,en) CE(ei) = 1. 

\ n— >oo / 

Proof. Lemma 13.31 implies that p(^u ^ limsup§„, for every u G V^(F) \ E{^i) 
1. □ 

Theorem 3.5. (SLLN for graph-valued random elements with a singleton mean- 
set.) Let T be a locally-finite connected graph and {Ci}iZi sequence of i.i.d. ran- 
dom T-elements. If the weight function Mjj(-) is totally defined and E(a) = {v} 
for some v G V{T), then 

lim S(a,...,a)=IE(ei) 
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almost surely (with probability one). 

Proof. p(3N s.t. Vn > N, M„(w) < inf„gv'(r)\{i,} Mn{u)j — 1, bv Lenima l3.31 and, 

hence, p(3N s.t. Vn > N, §(^i,...,C„) = {«}) ==1. □ 

3.1. Case of multi-vertex mean-sets. In this section we investigate a multi- 
vertex mean-set case and conditions under which the strong law of large numbers 
holds for such set. We reduce this problem to the question of recurrence of a 
certain subset in Z" relative to a random walk on this integer lattice. If 2 < 
< 3, no restrictive assumptions are required; we formulate and prove the law 
for these special instances separately. The case |E(^)| > 3 requires more technical 
assumptions, and, thus, more work to handle it. 

3.1.1. Preliminaries. Assume E(^i) = {vi,V2, ■ . ■ ,Vk}. Our goal is to find condi- 
tions that would guarantee the inclusion E(^i) C lim sup„_jg^ S„ or, without loss 
of generality, conditions for vi G limsup„_^oQ §„. 

By Lemma 13.31 it follows that, with probability one, for a sequence of random 
F-elements there exists a number N such that for any n > N we have 

max{M„ {vi ) , M„ (wa ) , . . . , M„ (wfe ) } < inf M„ (u). 

uer\{vi,v2....,Vk} 

Hence, for any n > N , vi Q Sn if and only if Af„(wi) < M„(ui) for every i = 2, . . . , k. 
Thus, to achieve our goal, we need to show that the system of inequalities 

Mn{v2) - Mn{vi) > 0, 
Mn{vk) - Mn{vi) > 0, 

is satisfied for infinitely many n G N. 
For i = 1, . . . , k — I and n G N, define 

R,in) = n{Mniv,+i) - M„(?;i)) = ^ {d^in,+i, s) - d?{vi, s))-\{i \ £.^ = s,l < i < n}\ 
and observe that 

(21) R,{n + l)-R,{n)^Y. [d\v,+^, s) - d\vi, s)] l{c„^,=.}. 

ser 

i.e., every Ri{n) represents a random walk on Z starting at 0. Consider a random 
walk i?, associated with ui, in Z'^"^, starting at the origin (0,...,0) with the 
position of the walk after n steps given by 

R{n) = {Ri{n), i?2(n), . . . , i?fc_i(n)). 

An increment step for R is defined by a vector ({s) — {(i{s), . . . , (k-i{s)), s G V{r), 
with probability /i(s), where 

Ci(s) = d'^{vi+i,s) - d^ivi^s). 

The following lemma shows the significance of this random walk. 

Lemma 3.6. In the notation of this section, vi G limsup„_^oQ §„ if and only if 
the random walk R visits the set Z_^r^ — {(ai, ■ • • ,aA;-i) | > 0} infinitely often. 
Therefore, 

P{vi G limsupS„) = P(R(n) G Z^"-\ i.o.). 
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Proof. Follows from the discussion preceding the lemma. □ 

It is worth redefining R in the terms of transition probability function, as in 
[?T] . Let G Z*^^^ be the zero vector and Xi — Q{s), s E ^(r). For every 
X = {xi, . . . G Z'^'^^, we define a function P{0,x) by 

(22) P{0,x) — fj,{s I Xi — d^{vi+i, s) — (f{vi,s) for every i ~ 1, . . . ,k — 1}. 

It is trivial to check that this is, indeed, the transition probability for R. To continue 
further, we investigate some properties of our random walk R. 

Lemma 3.7. Let R be a random walk defined above. Then 

mi— ^ xP{0,x)—0 and m2 — ^ \xfP{0,x) < oo. 

Proof. The first equality trivially holds. For the second one, we get 

\x\'p{o,x)= J2 

fe-l 2 

< ^ (^d{vi,s) +d{vr+i,s)j fj.{s) 

i=i sev{r) 
fc-i 

<J2d^i'"i''"^+i)i'^M{vi)+AM{v,+i)) < oo. 

i=l 

□ 

Clearly, conditions under which this random walk is recurrent would guarantee 
that vi C limsup„^oQ S„ (see [31] page 30, Proposition 3.3]). A general (not simple, 
not symmetric) one-dimensional random walk is recurrent if its first moment is zero 
and its first absolute moment is finite (see [31], pg. 23). Sufficient conditions for 
the recurrence of two-dimensional random walk involve the finiteness of its second 
moment and can be found in |31[ page 83]. The result stated there indicates that 
genuinely 2-dimensional random walk is recurrent if its first moment is zero, and 
its second moment is finite. Let us recall some important notions before we go on. 

Consider an arbitrary random walk R on Z" given by a transition probability 
P, as in ([22]) . The support, supp{P), of the probability measure P is defined to be 
the set supp{P) — {v E Z" | P{v) ^ 0} of all possible one-step increments of R. 
Further, with R, one can associate an abelian subgroup A-^ of Z" generated by the 
vectors in supp{P). It is well-known in group theory that any subgroup Aj^ of Z" 
is isomorphic to Z*^, where k <n (the reader can also check [31j Proposition?. 1 on 
pg.65] for details), in which case we write dim(^-^) = k and say that R is genuinely 
k- dimensional. Let us stress that we speak of an n-dimensional random walk on 
Z" when P(0,x) is defined for all x in Z"; this walk is genuinely n-dimensional if 
dim(yl;j=r) — n. We say that R is aperiodic if A-j^ = Z". Observe that genuinely n- 
dimensional random walk does not have to be aperiodic. A standard simple random 
walk, which we denote hy S = S{n), is an example of an aperiodic random walk 
on Z". It will be convenient to define a vector space Vj^ C M" spanned by the 
vectors in supp{P). It is easy to see that the genuine dimension of R is equal to 
the dimension of V-^. We shall need another notion for our developments. Assume 
that Z? is an fc X n matrix (not necessarily integer valued) which maps A-^ onto Z*^. 
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Then D naturally induces a random walk R on Z with transition probability P 
given by P°{u) = P(iJ e Z" \ D(j]) = u) for every u elJ'. 

3.1.2. Strong law of large numbers for two or three vertices mean-sets. Now, we 
can easily prove our strong law of large numbers for mean-sets with two or three 
elements. 

Theorem 3.8 (SLLN for graph random elements with two or three point mean-set). 

Let T be a locally-finite connected graph and be a sequence of i.i.d. random 

T-elements. If the weight function Afjj(-) is totally defined and 2 < |E(^)| < 3, 
then 

limsup§„ = lE(^i) 

holds with probability one. 

Proof. Assume that E(^) = {vi,V2}. Then the random walk R is one-dimensional. 
It is recurrent if 

ICi(s)Im(s) < oo and ^Ci(s)Ai(s) = 
(see [2T], pg. 23). The equality M{vi) — M{v2) implies the second conditions and 

E ICi i^M^) ^J2\d'{v2,s)~ («i , s) Hs) 

ser ser 

< Y{d^{v2,s)-\-d^{vi,s))fi{s) = M{vi)-\-M{v2) < oo 
ser 

implies the first condition. Hence, R is recurrent, and takes on positive and negative 
values infinitely often. We conclude that almost always limsupS„ = {vi,V2} = EC- 

n— f oo 

Assume that E(C) ~ {vi,V2,v^}. Then the random walk R can be genuinely 0, 
1, or 2-dimensional. The first case is trivial, the second can be considered as the 
case when |E(C)| = 2. So, assume R is genuinely 2-dimensional. By Lemma 13.71 
the first moment of R is (0, 0) and the second moment is finite. Now, it follows 
from [3TJ Theorem 8.1] that R is recurrent. In particular, Z^^^ is visited infinitely 
often with probability 1. 

In both cases, it follows from Lemma 13.61 that P(ui 6 limsup^^^o ^n) — 1- 
Hence the result. □ 

Recall that a subset of Z" is called recurrent if it is visited by a given random 
walk infinitely often with probability one, and it is transient otherwise. A criterion 
for recurrence of a set for a simple random walk was obtained in |19j for rt = 3 (it 
can also be found in |3H Theorem 26.1]). It turns out that the criterion does not 
depend on a random walk in question. This is the subject of the extension of the 
Wiener's test, proved in |34|, that we state below. This invariance principle is one 
of the main tools we use in our investigation of the recurrence properties of the 
positive octant in Z" for R. 

Theorem. (Extension of Wiener's test, [34]) Let n > 3. An infinite subset A of 
Z" is either recurrent for each aperiodic random walk R on Z" with mean zero and 
a finite variance, or transient for each of such random walks. 
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For a positive constant a S M and a positive integer m < n define a subset of M" 

Cone™ = . . e M" I xi = 0, . . . , a;„_„ = 0, \J x'^^_„^^^ + . . . + x'^^_^ < ax, 

called an m-dimensional cone in M". If m = n, then we omit the superscript in 
Cone™. For an n x n matrix D and a set ^ C K", define a set — {D-v \ v G A}, 
which is a linear transformation of A. If D is an orthogonal matrix, then the set 
(ConCct)^ is called a rotated cone. As in |19) , for any non-decreasing function 
i : N — > R+ define a set 

Thorrii = {tJ G Z" | vf + . . . + vl_-^ < i{vn)}. 

Observe that Cone^ H Z" = Thorrii where = at. 

Theorem 3.9. For any a > and any orthogonal matrix D, 

P(S'(n) G (Cone„)^, i.o) = 1, 

i.e., </ie probability that the simple random walk on Z" visits (Conea)^ infinitely 
often is 1. 

Proof. Direct consequence of (6.1) and (4.3) in [19], where the criterion for recur- 
rence of Thorui is given. □ 

Next two lemmas are obvious 

Lemma 3.10. Assume that a set A C M" contains a rotated cone. Then for any 
invertihle n x n matrix D, the set A^ contains a rotated cone. 

Lemma 3.11. If Si C 5*2 C M" and Si is visited by the simple random walk infin- 
itely often with probability 1 then S2 is visited by the simple random walk infinitely 
often with probability 1 . 

Now, we return to our strong law of large numbers for multi-vertex mean-sets. 
Assume that E<^ — {vi, . . . ,Vk}, where fc > 4. Let i?* be a random walk on Z*^"^, 
associated with Vi, where i = 1, . . . ,k (in our notation, R — R ). This is a (fc — 1)- 
dimensional random walk which, in general, is not aperiodic. In fact, R is not even 
genuinely (fc — l)-dimensional. Fortunately, it turns out that it does not matter to 
what vertex Vi we associate our random walk, since the choice of the vertex does 
not affect the dimension of the corresponding walk, as the following lemma shows. 

Lemma 3.12. Let ^ be a probability measure on a locally finite graph T such that 

I 

E/i = where k > 2. Then the random walks R , associated 

with vertices wi, . . . respectively, all have the same genuine dimension. 

Proof. We prove that random walks R^ and R^ have the same genuine dimen- 
sion. Recall that the subgroup A-j^i is generated by the set of vectors G Z'^~^ 
such that for some s G supp{ii), — v^{s) = {d'^{v2,s) — d'^{vi,s),d'^{v3,s) — 
d'^{vi, s), . . . , d'^{vk, s) — rf^(wi, s)) and the subgroup is generated by the set 
of vectors G such that for some s G supp(p), Ip' = v'^{s) = {d'^{vi,s) — 

d'^{v2, s), d^(u3, s) — d'^{v2,s), . . . , d'^{vk,s) — d^(ui, s)). Observe that for every s G 
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supp{ii) the equality v {s) = D ■ v [s) holds, where D is a (fc — 1) x (fc — 1) matrix 



/ -1 ... \ 

-110 ... 

D= -10 10 ... 

-10 1 ... 

V •• 

Therefore, = [A-^i)^ . Since the matrix D is invertible it follows that A-^i and 



A-s2 have the same dimension 

-Tt 



□ 



Theorem 3.13. Let T he a locally-finite connected graph and ^ sequence 

of i.i.d. random T-elements. Assume that the weight function M is totally defined 
and E(^) = {vi, . . . ,Vk}, where fc > 4. // the random walk R associated to vi is 
genuinely (fc — 1) -dimensional, then 

limsupSn = 

n— ^oo 

holds with probability 1. 

Proof. Since is genuinely (fc — l)-dimensional it follows that the subgroup A-j^i 
is isomorphic to Z*^"^ and there exists an invertible matrix D that isomorphically 
maps A-^i C Z''"^ onto Z''"^ Consider a set = {{xi, . . . ,Xk-i) \ Xi > 0}. 

Obviously, v{Tt e llx^ i.o.^ = e M^^^^ i.o.) . 

Let (R^)^ be the random walk on Z'^'^^ induced by D by application of D to 
R . The random walk {R )^ is aperiodic since D maps A-^i onto Z'^^^ and, by 
construction of (i?^)^, 

p(:r^ eM^-^i.o.) =p((i?V e (k';-!)^ i.o.). 

Let 5' be the simple random walk on Z*"'^^. Since (R^)^ and S are both aperi- 
odic random walks on Z*^^^, it follows from the Invariance Principle (Extension of 
Wiener's test) that 

p((:rV e O^i'^)^ i.o.) = P{S £ (K'^-i)^ i.o.). 

Clearly, the set contains a rotated cone and, hence, by Lemma fS.lOi its image 

under an invertible linear transformation D contains a rotated cone too. Now, 

by TheoremlSH P{S € (R^^^)^ i.o.) = 1. Thus, p(r^ G Z^'^ i.o.) = 1 and by 

Lemma 13.61 

P(ui e limsupSn) = 1. 

Finally, it follows from Lemma [3. 121 that for any i = 2, . . . , k the random walk R^ is 
genuinely (fc — l)-dimensional. For any i = 2, . . . , fc we can use the same argument 
as for vi to prove that P{vi G limsup„^oQ §„) = 1. Hence the result. □ 
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3.1.3. The case when random walk is not genuinely (k — 1) -dimensional. The case 
when R is not genuinely {k — l)-dimensional is more comphcated. To answer the 
question whether vi belongs to hmsup„_joQS„ (namely, how often vi G S„), we 
need to analyze how the space V-j^i "sits" in R*^"^. We know that the subgroup 
Aj^i C Z'^^^ is isomorphic to Z™, where m < fc — 1 in the case under consider- 
ation. Therefore, there exists a to x (fc — 1) matrix D which maps the subgroup 
A^i onto Z™ and which is injective onto Aj^i . Furthermore, the mapping D maps 
the subspace V-^i bijectively onto M'". The linear mapping D induces an ape- 
riodic random walk {R )^ on Z™ in a natural way and P(^R^ e (R^-^) 1.0. = 

p(;R^ e (M+"^ n y^i) i.o.) ^ P((i?V e (IR+"^ n Vj^i)^ i.o.). The main problem 
here is to understand the structure of the set (M'p^ n V^i )^ and, to be more precise, 
the structure of the set B-^i = M'p^ n V^i. Clearly B^i is a monoid, i.e., contains 
the trivial element and a sum of any two elements in Bj^i belongs to B-^i . We can 
define dimension of B-^i to be the maximal number of linearly independent vectors 
inS^i. 

Theorem 3.14. Suppose A-^i ~ Z™ and the set B-^i has dimension m. Then 
F{vi G limsup„^^§„) = 1. 

Proof. Since B-^i is a monoid of dimension m it is not hard to see that B^i contains 
an TO-dimensional rotated cone. Since Z) is a linear isomorphism from V-^i onto M™ 
it follows by Lemma 13.101 that [B-^i)^ contains an TO-dimensional rotated cone in 
M™. If S" is a simple random walk in Z'" then P(5' e {B^i)^ i.o.) = 1 and since S 

and {R^)^ are both aperiodic, by the extension of Wiener's test (Invariance Prin- 
ciple), we see that P((#)^ £ (-^-rO^ i-O-) = 1- Hence, p(i?^ e (M^-^) i.o.) = 1 
and by Lemma [3.61 P(vi G limsup^^g^ §„) = 1. 

□ 

Below we investigate under what conditions the subgroup A-^i and the set B-^i 
have the same dimension m. 

Lemma 3.15. Assume that A-^i contains a positive vector. Then A-^i and the set 
R*)^^ n V-j^i have the same dimension. 

Proof. Straightforward. □ 

Lemma 3.16. Assume that /i(ui) ^ 0. Then A-^i and the set R^^^ have the 

same dimension. 

Proof. Observe that ii ^{vi) ^ then A-^i contains the vector (d^(w2, ^'i), • • • , <P'{vkTVi)) 
which has all positive coordinates. Therefore, by Lemma 13.151 the set A-^i and 



n V-j^i have the same dimension. □ 



Corollary 3.17. Let F be a locally-finite connected graph and {^ili^i be a se- 
quence of i.i.d. random F-elements. Assume that the weight function Mjj^(-) is 
totally defined and E(^) = {wi, . . . , Wfe}, where fc > 4. If E(^i) C supp{fi) then 
limsup„_^oQ §„ — E(^i) holds with probability one. 

Proof. Follows from Lemma 13.161 13.151 and 13.141 □ 
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4. Concentration of measure inequalities 

Concentration inequalities are upper bounds on the rate of convergence (in prob- 
ability) of sample (empirical) means to their ensemble counterparts (actual means). 
Chebyshev inequality and ChernofF-Hocffding exponential bounds are classical ex- 
amples of such inequalities in probability theory. In this section, we prove analogues 
of the classical Chebyshev's inequality and Chernoff-Hoeffding like bounds - the con- 
centration of measure inequalities for a graph- (group-)valued random elements. 

4.1. Chebyshev's inequality for graphs/groups. The classical Chebyshev's 
inequality asserts that if ^ is a random variable with E(^^) < oo, then for any 
e > 0, we have 

(23) P(k-E(OI>£)<4' 

where cr^ — Var{£^, see 0. 

Chebyshev discovered it when he was trying to prove the law of large numbers, 
and the inequality is widely used ever since. Chebyshev's inequality is a result 
concerning the concentration of measure, giving a quantitative description of this 
concentration. Indeed, it provides a bound on the probability that a value of a 
random variable ^ with finite mean and variance will differ from the mean by more 
than a fixed number e. In other words, we have a crude estimate for concentration 
of probabilities around the expectation, and this estimate has a big theoretical 
significance. 

The inequality (|23p applied to the sample mean random variable X = where 
•Sn = Ci + . . . + in-, lE(Ci) = m, Var{i{) = cr^, i = 1, . . . , n results in 

2 

(24) P(|^-m| > e) < 

The goal is to prove a similar inequality for a graph- valued random element ^. 

Lemma 4.1. Let pi he a distribution on a locally finite graph T such that M = M*^^-* 
is defined. If for some r G N and vq G V{T) the inequality 

(25) d{vo,s)n{s) ~^fi{vo) <0 
sey(r)\B,,„(r/2) 

holds, then M{u) > M{vo) for any u€V(r)\ By„(r). 

Proof. Indeed, pick any u G V{T) \ By^(r) and put d = d{vQ,u). Then 

Miu)-Mivo)^ J2 {d^{u,s)-d\vo,s))^iis) 

sev(r) 

> d^p{vo) - Y {d^ivo, s) - d^{u, s))fi{s) 
> d'^ li{vo) — 2d d{vQ, s)ij,{s) > d"^ p{vo) ~ 2d d(wo, s)^(s). 

d(vo,s)>d{u,s) seV(r)\S„o(r/2) 

Since c? > r it follows that the last sum is positive. Thus M{u) > M{vo) as required. 

□ 
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Theorem 4.2. Let T be a locally-finite connected graph and a sequence 

of i.i.d. random T -elements. If the weight function A/^j^ is totally defined and 
E(,fi) = {v} for some v G V{T) then there exists a constant C — C(r,^i) > such 
that 

(26) P(§(ei,...,?„)# W) < ^. 

Proof. It follows from the definition of the sample mean-set that 
{§„ ^ {v}} - {3u e ViV) \ {v}, M„(u) < M„{v)}. 

Hence, it is sufficient to prove that P^3m G V(r) \ {v}, Mn{u) < M„(w)^ < ^, 

for some constant C. We do it in two stages. We show that for some vq G V(r) 
and constants r G N, Ci , C2 G M such that v G B^^ (r) and inequalities 

(27) P (bu G B„„ (r) \ {«}, M„(m) < M„(w)^ < ^ 
and 

(28) p(3u G V{r)\BM, Mn{u) < Mn{vo)^ < ^ 

hold. Clearly, for any u,vo,v G V{r) if Mn{u) < Mn{v) then either Mn{u) < 
Mnivo) or Af„(z;o) < Af„(u). It is not hard to see that if we find Ci and C2 
satisfying (|27p and (|28p respectively, then (f26| holds for C = Ci + C2 and the 
theorem is proved. 

First we argue (^5]) . Choose any vq G T^(r) such that /x(t'o) > and r G N such 
that the inequality ((25|) holds. We can choose such r since Af(i)(wo) is defined. 
Observe that the left hand side of the inequality above is the expectation of a 
random variable 77 : y M defined as 77(3) — c?(tio, s)lv(r)\B„Q(r/2)('S) — §luo(^): 
s G ^(r), where l (s) is an indicator function. Since by our assumption M = 
M'-^^ is defined, it follows that cr^(r?) is defined, and, applying Lemma [4. II and the 
Chebyshev inequality with e — |E?7|/2, we obtain 



P\^3u G ViT) \ B^,{t), M„iu) < MM 
<P d(«o,s)Ai„(s)--Ai„(i;o)-Er7 > |Ery|/2j < — ^. 

Hence, inequality (|28l) holds for C2 = (^2(7", uq, /i) = ^^^Tp^- prove (l?7)) we notice 

that for any u G F(r) \ {w}, Af(u) - Af(w) = ^ (d(w,s) - d(?;,s))(d(M,s) + 

sev(r) 

(i(u, s))/i(s), i.e., Af(M) — M(v) is the expectation of a random variable r : y — > M 
defined as 

T„,„(s) = (d(M, s) - d{v, s)){d{u, s) + d{v, s)), s G V{T). 

Furthermore, since Af^j^(-) is defined and for every s G V(T), d(u,s) — d(v,s) < 
d{v,u), it is easy to see that cr^(r„^t,(s)) < 00. Thus, by the Chebyshev inequality 
for the sample average of t„_i,(s), 

p(\Mr,iu)-A'Uv)-{M{u)-M{v))\>e) < "'^^"'^^'^^ 
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holds. Now, if < £ < M{u) - M{v), then 

P ( M„(zi) < Mniv)] < P (\Mn{u) ~ M„(«) - {M(u) - M(v))\ > e 



Finally, we choose s to be iinf{M(w) — M{v) \ u e Byg{r) \ {v}} and using 
cr-additivity of measure we see that inequality (1271) holds for the constant Ci = 

□ 

In fact, one can easily generalize the previous theorem to the following statement. 

Theorem 4.3. Let T be a locally-finite connected graph and {Ci}iZi o, sequence of 

1.1. d. random T -elements. If the weight function M^-^^ is totally defined then there 
exists a constant C = C(r,^i) > such that 

(29) P(§(6,...,e„) 2E(0) < ^- 

4.2. ChernofF-Hoeffding like bound for graphs/groups. Let be indepen- 
dent random variables. Assume that each Xi is almost surely bounded, i.e., assume 
that for every z G N there exists a^, hi G M such that P{xi—Exi € [ai,bi]) = 1. Then 
for Sn = X]"=i ^^'^ fo'^ ^'^y e > we have the inequality (called the Hoeffding's 
inequality) 

VilSn ~ ESn\ > ne) < 2exp 



If Xi are identically distributed then we get the inequality 



— {xi-\-...-\- Xn) - Ea;i 
n 



> e < 2 exp 



2£2 



{h-aY 



Techniques of the previous section can be used to find a similar bound on P ^S(^i , . . . , ^„) % E(^) 
for a sequence of iid graph-valued satisfying some additional assumptions. 

Theorem 4.4. Let T be a locally-finite connected graph and {Ci}iZi sequence of 
i.i.d. random T-elements. If the weight function Mjj(-) is totally defined and /^^^ 
has finite support then for some constant C > 



(30) P(§(ei,...,e„) ^E(O) <0(e- 

Proof. Proof is similar to the proof of Theorem l4.2l We find vq G V^(r), r G N, and 
constants Ci , C2 > such that inequalities 

(31) P(3u G B„„(r) \ {v}, Mn{u) < M„(«)^ < 0(e-^i") 
and 

(32) p(3u G V{T) \ B,„{r), M^{u) < Af„(«o)j < ©(e"^^") 
hold. 

Choose vo G V{T) and r G N exactly the same way as in Theorem 14. 2 1 Note that 
a random variable 77(5) = d{vo, s)lv{r)\B„g{r/2)is) — |l^(,(s) (where s G V(T)) is 
almost surely bounded. Choose a lower and an upper bounds for 77 and denote them 
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by a and b respectively. Now, applying Hoeffding's inequality to rj with e = IIE77I/2 
we obtain 



P[3ue ViV) \ B,,{r), M„(u) < MM 



<Pf d(wo,5)M«(s) - ^Ai«(i'o) -E?? > |Ery|/2) < 2exp 

sey(r)\s„o(r/2) 

Therefore, ^ holds for C2 = 



r 



Et]\ 

7 



2(6- a) 

2(fc-a)^ ■ 

To prove pil) we notice that for any v G E(^) and u G ^(T) \ E(^) we have 
M{u)-M{v) = ^ s) s))(d(u, s) s))^(s), i.e., M(u) - M(w) is 

sev{r) 

the expectation of a random variable t„^„ : 1^ — > R defined as 

Tu,v{s) = {d{u, s) - d{v, s)){d{u, s) + d{v, s)), s e F(r). 

Furthermore, since ^1 has finite support it follows that the random variable T„^t,(s) 
almost surely belongs to [au,v,bu,v]- Thus, by the Hoeffding's inequality for the 
sample average of Tu,vis), 

{bu,v ^u,v)'^ 



2 



P|^|M„(u) - Mn{v) - {M{u) - M{v))\ > £ ) < 2exp 
holds. Now, if < £ < M(u) - M(w), then 



V[m^{u) < M„{v)j < P(^|M„(u) - M„(«) - {M{u) - M{v))\ > e 

Choose £ to be iinf{M(u) - M{v) \ v G E(0, u G B^„{r) \ E(0} and 5 = 
max{&„^t, — a^^v \ v G E(^), u G Byg{r) \ E(^)}. Finally, using cr-additivity of 
measure we see that inequality (pij) holds for the constant Ci = □ 

5. Configurations of mean-sets with applications 

In this section, we discuss several configurations of mean-sets on graphs and, in 
particular, on trees and free groups. First, we make a simple observation stated in 
the lemma below. 

Lemma 5.1. Let T be a connected graph. Then for any v G V{T) there exists a 
measure fj, such that E(/i) = {v}. 



Proof. Indeed, the statement of the lemma holds for the distribution defined by 

1, ii u = v; 
0, otherwise. 

□ 



On the other hand, it is easy to see that not any subset of V{r) can be realized 
as E(/i). For instance, consider a graph as in Figure[TJ Let fiQ = /x(uo), fJ-i = 



V/ Vo V2 



Figure 1. Impossible configuration of centers (gray vertices). 
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= Ai(w2), Mo = M{vo), Ml = M{vi), M2 = M{v2) Then Mi = ^0 + 4/i2, 
Mo — ^1+^2, M2 — 4/ii + /iQ. Clearly, for no values of /iq, /ii, and /i2 both 
inequalities Mo > Mi and Mq > M2 can hold simultaneously (since we can not 
have 2Mo > Mi + M2). Thus, vi and V2 can not comprise E/i. In fact, a tree can 
have only a limited configuration of centers as proved in Proposition 15.81 below. 

Let r be a graph. We say that vo £ V(T) is a cut-point if removing vo from F 
results into a disconnected graph. The same definition holds for any metric space. 
It turns out that existence of a cut-point in F affects configurations of mean-sets. 
The following lemma provides a useful inequality that holds for any metric space 
with a cut-point. 

Lemma 5.2 (Cut-point inequality). Let (F,d) be a metric space and vo a cut point 
in F. // vi, V2 belong to distinct connected components of T \ {vo} then for any 
s G V{T) the inequality 

(33) d{vo,V2){d^vus) ^ d^vo,s)) + d{vo,vi){d^V2,s) - d^vo,s)) >C>0 

holds, where C = C(wo, wi, W2) = d{vo,V2)d{vo,vi){d{vo,vi) -\- d{vo, V2 j)- 

Proof. Denote the left hand side of (|33|) by g{s). There are 3 cases to consider. 
Case 1. Assume that s does not belong to the components of vi and V2- Then 

d{vo,V2){d^{vi,s) -d^{vo,s)) +d{vo,vi){d^{v2,s) -d^{vo,s)) 
= d{vo, V2)d{vo, vi){2d{vo, s) -\- d{vo, vi)) -\- d{vo, vi)d{vo,V2){2d{vo, s) -\- d{vo,V2)) 
= d{vo, V2)d{vo, wi)(4d(wo, s) -\- d{vo, vi) -\- d{vo,V2)) > d{vo, V2)d{vo,vi){d{vo,vi) -\- d{vo, 
and hence ([55)) holds. 

Case 2 . Assume that s belongs to the component of vi . Define 

X = x{s) = d{vi, s) and y = y{s) ~ d{vo, s). 
In this notation we get 

g{s) = g{x,y) = d{vQ,V2){x^ - y"^) + d{vo,vi){2yd{vo,V2) + d^(wo,W2))- 
Dividing by a positive value d{vo,V2), we get 

(7(s)>0 if and only if ^\ ^ x^ - y^ ^ d{vo,vi){2y + d{vo, ^2)) > 0- 

d[vo,V2) 

Now, observe that the numbers x, y, and d{vo,vi) satisfy triangle inequalities 

X + y > d{vo,vi); 
X + d{vo,vi) > y; 
y + d{vo,vi) > X] 

that bound the area visualized in Figure [S] The function of two variables ^^^^'^^) 
attains the minimal value c?^(wo,wi) + d{vo,vi)d{vo,V2) on the boundary of the 
specified area. Hence, the inequality g{s) > d{vo,V2)d{vo,vi){d{vo,vi) +d{vo,V2)) 
holds for any s in the component of vi . 

Case 3. If s belongs to the component of V2 then using same arguments as for 
the previous case one shows that (|33p holds. □ 

Corollary 5.3. Let F be a connected graph, vo a cut-point in F, and vi,V2 belong 
to distinct components of F \ {wq}. Then the inequality 

d{vo,V2){M{vi) - Mivo)) + d{vo,vi){M{v2) - M{vo)) > C > 

holds, where C — C(wo, wi, W2) = d{vo,V2)d{vo, vi){d{vo, vi) + d{vo,V2))- 
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X 



d(vo,vi) 



Figure 2. Area of possible triangle side lengths. 

Proof. Indeed, 

divQ,V2){M{v,) - Mivo)) + d{vQ,vi){M{v2) ~ M{vq)) 

= ^ {d{vo,V2){(f{vi,s) - d^{va, s)) + d{va,vi)[d^{v2,s) - d^{vQ, s)))fi{s) 
sev{r) 

> ^ Cfi{s) — C — d{vo,V2)d{vQ,vi){d{vo,vi) + d{vo,V2)), hy Lemma.\5^ 
sevir) 

□ 

Corollary 5.4 (Cut Point Lemma). Let F be a connected graph, vq a cut-point 
in F. If vi and V2 belong to distinct connected components of F \ {wo}, then the 
inequalities M{vo) > M{vi) and M{vo) > M{v2) cannot hold simultaneously. 

Proof. Assume to the contrary that M(vo) > M{vi) and M{vo) > M{v2) hold 
simultaneously which is equivalent to M{vi) — M{vq) < and M{v2) — M{vq) < 0. 
Then, multiplying by positive constants and adding the inequalities above, we get 
d{vo,V2){M{vi) - M{vo)) + d{vo,vi){M{v2) - M{vo)) < which is impossible by 
Corollarv l5.3l This contradiction finishes the proof. □ 

Corollary 5.5 (Mean-set in a graph with a cut-point). Let vq be a cut-point 
in a graph F and F \ {vq} a disjoint union of connected components Fi, . . . ,Ffe. 
Then for any distribution /L^ on F there exists a unique i = 1 , . . . , fc such that 

EfiCV{T,)U{va}. 

Corollary 5.6 (Mean-set in a graph with several cut-points). Let wi, ...,?;„ be cut- 
points in a graph F and F \ {vi, . . . ,Vn} a disjoint union of connected components 
Fi, . . . , Ffe. Then for any distribution /x on F there exists a unique i — 1, . . . , k such 
that C V{T,) U {wi, . . .,«„}. 

Corollary 5.7. Let Gi and G2 be finitely generated groups and G = Gi * G2 & 
free product of Gi and G2. Then for any distribution /i on G the set E/i is a subset 
of elements of the forms gGi or 5G2 for some element g ^ G. 

Proposition 5.8. Let F be a tree and fi a probability measure on V^(F). Then 
|E/i| < 2. Moreover, if E/i = {u, v} then u and v are adjacent in F. 
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Proof. Observe that any points vi,vo,V2 such that vq is eonnected to vi and V2 
satisiy the assumptions of Cut Point Lemma (Coronarv l5.4p . Assume that vq G E/i. 
At most one of the the neighbors of vq can belong to E/i, otherwise we would have 
3 connected vertices with equal M values which contradicts Cut Point Lemma. 

□ 

Corollary 5.9. Let be a probability distribution on a free group F. Then 
|E/i| < 2. 

In general, the number of central points can be unlimited. To see this, consider 
the complete graph Kn on n vertices and let be a uniform probability distribution 
on V{Kn). Clearly En = V{Kn)- Another example of the same type is a cyclic 
graph Cn on n vertices with a uniform probability distribution n on V{Cn)- Clearly 
E/i = V{Cn)- In all previous examples, the centers in a graph formed a connected 
subgraph. This is not always the case. One can construct graphs with as many 
centers as required and property that distances between centers are very large (as 
large as one wishes). 

6. Computation of mean-sets in graphs 

In this section we discuss computational issues that we face in practice. One 
of the technical difficulties is that, unlike the average value Sn/n for real-valued 
random variables, the sample mean-set §„ = ■ ■ • j Cn) is hard to compute. Let 
G be a group and {^jf^j^ a sequence of random i.i.d. elements taking values in G. 
Several problems arise when trying to compute S„: 

• Computation of the set {M{g) \ g G G} requires 0(|Gp) steps. This is 
computationally infeasible for large G, and simply impossible for infinite 
groups. Hence we might want to reduce the search of a minimum to some 
small part of G. 

• There exist infinite groups in which the distance function | • | is very difficult 
to compute. The braid group Boo is one of such groups. The computation 
of the distance function for Boo is an NP-hard problem, see [55]. Such 
groups require special treatment. Moreover, there exist infinite groups for 
which the distance function | ■ | is not computable. We omit consideration 
of such groups. 

On the other hand, we can try to devise some heuristic procedure for this task. As 
we show below, if the function M satisfies certain local monotonicity properties, 
then we can achieve good results. The next algorithm is a simple direct descent 
heuristic which can be used to compute the minimum of a function /. 

Algorithm 6.1. (Direct Descent Heuristic) 

Input: A graph P and a function / : y(P) — R. 
Output: A vertex v that locally minimizes / on P. 
Computations: 

A. Choose a random v e V^(P). 

B. If V has no adjacent vertex with smaller value of /, then output current v. 

C. Otherwise put v <^ u where u is any adjacent vertex such that f{u) < f{v). 
Go to step B. 
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It turns out that if a function / satisfies certain local properties, then we can 
achieve good results; namely, the proposed algorithm finds the vertex that mini- 
mizes / on r exactly. We say that a function / : V{T) — > R is locally decreasing if 
at any vertex v e ^^(r), such that / does not have minimum at v, there exists an 
adjacent vertex u such that f{u) < f(v). We say that a function / is locally finite 
if for any a, 6 £ R the set fiV(T)) n [a, b] is finite. 

Lemma 6.2. Let T be a graph and f : V{T) — > R a real-valued function that attains 
its minimum on T. If f is locally decreasing and locally finite, then Alaorithm \6.1\ 
for r and f finds the vertex that minimizes f onT. 

Proof. Let v G V{T) be a random vertex chosen by Algorithm 16.11 at Step A. If v 
is a minimum of /, then the algorithm stops with the correct answer v. Otherwise, 
the algorithm, at Step C, chooses any vertex u adjacent to v such that f{u) < f{v). 
Such a vertex u exists, since the function / is locally decreasing by assumption. 
Next, Algorithm 16.11 performs the same steps for u. Essentially, it produces a 
succession of vertices vq, wi, U2, . . . such that = v and, for every i = 0, 1, 2, . . ., 
the vertices Vi,Vi+i are adjacent in F with the property f{vi) > f{vi+i). 

We claim that the constructed succession cannot be infinite. Assume, to the con- 
trary, that the chain vo,vi,V2, ■ ■ ■ is infinite. Let m be the minimal value of / on F. 
Then f{V{r)) n [to, f{v)] is infinite, and, / cannot be locally finite. Contradiction. 
Hence the sequence is finite, and the last vertex minimizes / on V{T). □ 

Lemma 6.3. Let ^ be a distribution on a locally finite graph F such that a weight 
function A/(-) is defined. Then the function M{-) is locally finite on F. 

Proof. Since the function M is non-negative, it suffices to prove that for any b G R+ 
the set M(V{r)) n [0,b] is finite. Let v € E(C)) i-e., v minimizes the value of M, 
and r e N such that < ^M{v) < J2ieB^{r) '^^(^i ^^e proof of Lemma 

12.41 Choose an arbitrary value b e R+ and put a ~ max{2, 6/M(u)}. Then 
one can prove (as in Lemma 12. 4p that for any u E F \ i?„((Q; + 2)r), we have 
M(m) > {a+ l)M{v) > b. Therefore, M(y(F)) n [0, b] C M{B^{{a + 2)r)) and the 
set B,j{{a + 2)r) is finite. □ 

Theorem 6.4. Let fi be a distribution on a locally finite tree T such that a function 
M is totally defined. Then Alaorithm \6.1\ for T and M finds a central point (mean- 
set) of /i on T . 

Proof. Follows from Lemmata 121 El andHH □ 

Note, the function M is not locally decreasing for every graph, and a local 
minimum, computed by Algorithm 16. 1[ is not always a global minimum. 

7. Experiments 

In this section we demonstrate how the technique of computing mean-sets, em- 
ploying the Direct Descent Algorithm 16.11 described in section [51 works in practice 
and produces results supporting our SLLN for graphs and groups. More precisely, 
we arrange series of experiments in which we compute the sample mean-sets of 
randomly generated samples of n random elements and observe a universal phe- 
nomenon: the greater the sample size n, the closer the sample mean gets to the 
actual mean of a given distribution. In particular, we experiment with free groups. 
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in which the length function is easily computable. All experiments are done using 
the CRAG software package, see [7]. 

One of the most frequently used distributions on the free groups is a uniform 
distribution fi^ on a sphere of radius L defined as Sl = {w € F{X) \ \w\ = L}. 
Clearly, Sl is finite. Therefore, we can easily define a uniform distribution on 
it as 




if \w\ = L; 
otherwise. 



The reader interested in the question of defining probabilities on groups can 
find several approaches to this issue in [B]. One of the properties oi is that its 
mean-set is just the trivial element of the free group F{X). Observe also that the 
distance of any element of F{X) to the mean-set is just the length of this element 
(or length of the corresponding word) . 

Table [T] below contains the results of experiments for the distributions fj,io, 
/^20) A*50 on the group F4. The main parameters in our experiments are the rank r 
of the free group, the length L, and the sample size n. For every particular triple 
of parameter values (r,L,n), we perform series of 1000 experiments to which we 
refer (in what follows), somewhat loosely, as series {r,L,n). Each cell in the tables 
below corresponds to a certain series of experiments with parameters {r,L,n). In 
each experiment from the series (r, L, n), we randomly generate n words wi, . . . , w„, 
according to distribution /i^, compute the sample mean-set §„, and compute the 
displacement of the actual center oi from §„. The set §„ is computed using 
Algorithm 16.11 which, according to Theorem 16. 4[ always produces correct answers 
for free groups. Every cell in the tables below contains a pair of numbers {d, N); 
it means that in N experiments out of 1000 the displacement from the real mean 
was d. 



L\n 


2 


4 


6 


8 


10 


12 


14 


16 


MS 


(0,885) 
(1,101) 
(2,13) 
(3,1) 


(0,943) 
(1.55) 
(2,2) 


(0.978) 
(1,22) 


(0.988) 
(1,12) 


(0.999) 
(1,1) 


(0,998) 
(1,2) 


(0.1000) 


(0,999) 
(1,1) 




(0,864) 
(1,117) 
(2,16) 
(3,2) 
(4,1) 


(0.930) 
(1.69) 
(2,1) 


(0.976) 
(1,24) 


(0.993) 
(1,7) 


(0.994) 
(1.6) 


(0,999) 
(1,1) 


(0.1000) 


(0.1000) 


M20 


(0,859) 
(1,116) 
(2,19) 
(3,6) 


(0.940) 
(1,58) 
(2,2) 


(0.975) 
(1,25) 


(0.985) 
(1,15) 


(0.991) 
(1,9) 


(0.1000) 


(0,999) 
(1,1) 


(0,999) 
(1,1) 




(0,872) 
(1,108) 
(2,19) 
(3,1) 


(0.928) 
(1,71) 
(2,1) 


(0.984) 
(1,16) 


(0.991) 
(1,9) 


(0.998) 
(1,2) 


(0,997) 
(1,3) 


(0,998) 
(1,2) 


(0,999) 
(1,1) 



Table 1. The results of experiment for F4. 



By doing experiments for free groups of higher ranks, one can easily observe 
that as the rank of the free group grows, we get better and faster convergence. 
Intuitively, one may think about this outcome as follows: the greater the rank is, 
the more branching in the corresponding Cayley graph we have, which means that 
more elements are concentrated in a ball, and the bigger growth (in that sense) 
causes the better and faster convergence. 



28 



N. MOSINA AND A. USHAKOV 



Acknowledgements. We are grateful to loannis Karatzas for his support and 
his time spent reading this manuscript. The authors also extend their gratitude to 
Persi Diaconis for his suggestions of useful literature sources. In addition, we are 
thankful to Gerard Ben Arous for a discussion about n-dimensional random walks. 

References 

S. Ahmad, Elements aleatoires dans les espaces vectoriels topologiques, Ann. Inst. Henri 
Poincarc 2 (1965), pp. 95-135. 

A. Beck, On the strong law of large numbers. Ergodic Theory, Proceedings of the International 
Symposium, pp. 21-53. Academic Press, New York, 1963. 

A. Beck and D. Giesy, P-uniform convergence and a vector-valued strong laui of large num- 
bers, T. Am. Math. Soc. 147 (1970), pp. 541-559. 

R. N. Bhattacharya and V. Patrangenaru, Large sample theory of intrinsic and extrinsic 
sample means on manifolds - /, Ann. Statist. 31 (2003), pp. 1-29. 
P. Billingsley, Probability and Measure. Wiley-Interscience, 1995. 

A. Borovik, A. Myasnikov, and V. Shpilrain, Measuring sets in infinite groups. Computa- 
tional and Statistical Group Theory, Contemporary Mathematics 298, pp. 21-42. American 
Mathematical Society, 2002. 

CRyptography And Groups (CRAG) C++ Library, available at 
http : / /www . acc . Stevens . edu/ downloads . php 

J. Cuesta and C. Matran, The strong law of large numbers for k-means and best possible nets 
of Banach valued random variables, Probab. Th. Rel. Fields 78 (1988), pp. 523-534. 
P. Dehornoy, Braid-based cryptography. Group theory, statistics, and cryptography. Contem- 
porary Mathematics 360, pp. 5—33. American Mathematical Society, 2004. 
P. Diaconis, Group Representation in Probability and Statistics, Lecture Notes - Monograph 
Series. Institute of Mathematical Statistics, 1988. 

W. Feller, An Introduction to Probability Theory and Its Applications: Volume 2. John Wiley 
&; Sons, New York, 1971. 

M. Frechet, Les elements aleatoires de nature quelconque dans un espace distancie, Annales 
de llnstitut Henri Poincare 10 (1948), pp. 215-310. 

H. Furstenberg, Noncommuting random products. Trans. Amer. Math. Soc. 108 (1963), 
pp. 377-428. 

H. Furstenberg and H. Kesten, Products of random matrices, Ann. Math. Statist. 31 (1960), 
pp. 457-469. 

I. Gelfand, Sur un lemme de la theorie des espaces lineaires., Commun. Inst. Sci. Math, et 
Mecan., Univ. Kharkoff et Soc. Math. Kharkoff, IV. Ser. 13 (1936), pp. 35-40. 
P. Grenander, Probabilities on Algebraic Structures, Dover Books on Mathematics. Dover 
Pubhcations, 2008. 

M. Gromov, Metric Structures for Riemannian and Non-Riemannian Spaces based on 
Structures Metriques des Varietes Riemanniennes . Edited by J. LaFontaine and P. Pansu. 
Birkhauser, 1999. 

S. Huckemann, On the meaning of mean shape, preprint. Available at 
http://arxiv .org/abs/1002.079 5 2010. 

K. Ito and H. P. McKean, Jr., Potentials and the random walk, Illinois J. Math. 4 (1960), 
pp. 119-132. 

A. Karlsson and F. Ledrappier, On Laws of Large Numbers for Random Walks, Ann. Probab. 
34 (2006), pp. 1693-1706. 

D. Kendall, D. Barden, T. Carne, and H. Le, Shape and Shape Theory, Wiley Series in 
Probability and Statistics. Wiley, 1999. 

A. N. Kolmogorov, La transformation de Laplace dans les espaces lineaires, CD. Acad. Sci. 
Paris 200 (1935), pp. 1717-1718. 

A. G. Miasnikov, V. Shpilrain, and A. Ushakov, Group-based Cryptography, Advanced 
Courses in Mathematics - CRM Barcelona. Birkhauser Basel, 2008. 

N. Mosina and A. Ushakov, Mean set attack: Cryptanalysis of Sibert et al. authentication 
protocol, available at |http : //arxiv ■ org/abs/100 6 .4850 2010. 

E. Mourier, Elements aleatoires dan unespace de Banach, Ann. Inst. Henri Poincare 13 
(1953), pp. 159-244. 



SSLN FOR GRAPHS AND GROUPS 



29 



[26] M. Patcrson and A. Razborov, The set of minimal braids is co-NP-complete, J. Algorithms 
12 (1991), pp. 393-408. 

[27] B. J. Pettis, On integration in vector spaces, T. Am. Math. Soc. 44 (1938), pp. 277-304. 
[28] S. Pincus, Strong laws of large numbers for products of random matrices, T. Am. Math. Soc. 
287 (1985), pp. 65-89. 

[29] G. Rubinshtein, On multiple-point centers of normalized measures on locally compact metric 

spaces, Siberian Math. J. 36 (1995), pp. 143-146. 
[30] A. V. Skorohod, Basic Principles and Applications of Probability Theory. Springer, 2004. 
[31] F. Spitzer, Principles of Random Walk. Springer, 2001. 

[32] H. Sverdrup-Thygeson, Strong Law of Large Numbers for Measures of Central Tendency and 
Dispersion of Random Variables in Compact Metric Spaces, Ann. Stat 9 (1981), pp. 141-145. 

[33] R. L. Taylor, Some laws of large numbers for normed linear spaces, Ann. Math. Stat. 43 
(1972), pp. 1267-1274. 

[34] K. Uchiyama, Wiener's test for random walks with mean zero and finite variance, Ann. Prob. 
26 (1998), pp. 368-376. 

[35] W. Woess, Random walks on infinite graphs and groups - a survey on selected topics. Bull. 
London Math. Soc. 26 (1994), pp. 1-60. 

[36] H. Ziezold, Expected figures and a strong law of large numbers for random elements in quasi- 
metric spaces. Trans. 7th Prague Conf. Inf. Theory, Stat. Dec. Punc, Random Processes A, 
pp. 591-602, 1977. 

Department of Mathematics, CUNY/LAGCC. Long Island City, NY, USA 
E-mail address: nmosina@lagcc.cmiy.edu; mosina@niath.columbia.edu 

Department of Mathematics, Stevens Institute of Technology, Hoboken, NJ, USA 
E-mail address: sasha.ushakovSgmail.com 



